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Abstract. The subject of persistent homology has vitaUzed appUcations of algebraic topology to point 
cloud data and to application fields far outside the realm of pure mathematics. The area has seen several 
fundamentally important results that are rooted in choosing a particular algebraic foundational theory to 
describe persistent homology, and applying results from that theory to prove useful and important results. 

In this survey paper, we shall examine the various choices in use, and what they allow us to prove. 
We shall also discuss the inherent differences between the choices people use, and speculate on potential 
directions of research to resolve these differences. 



To Gunnar Carlsson 



Johnstone [4^ named his book on topos theory "Sketches of an elephant" , referencmg a joke: three bhnd 
wise men encounter an elephant. They each try to describe it to each other. The wise man who caught hold 
of the elephant's trunk says "An elephant is like a snake."; the wise man holding the ear says "An elephant 
is like a palm leaf." ; and the wise man holding its leg says "An elephant is like a tree." . 

The joke is highly relevant to topos theory; which has its roots in logic, in geometry, and in topology, with 
the three perspectives being fundamentally different and enriching each other in surprising ways. 

The title of this paper is similar, but different. The platypus is well-known to be a hybrid of an animal: 
sharing traits both with the phylum of birds and with the phylum of mammals. The field of persistent 
homology is in a similar situation to the platypus: there are two different viewpoints of what persistent 
homology should be, and they interact in sometimes unexpected ways. 
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1. Introduction 



Persistent homology is a technique that has sparked the birth of a new field of research; while several 
introductory texts have been written |6| |35| [45| |60j , and several good survey articles have been published 



19 36 , most if not all target computer scientists or data scientists with an interest in topology. 



The development of persistent homology and topological data analysis has been driven by algorithm devel- 
opment. In this paper, we will try to describe the field and its development with a view towards the different 
foundational viewpoints that have been leveraged to prove increasingly valuable results in the field. 

As an alternative, this article proposes to be an introductory survey targeting mathematicians with an interest 
in the applicability, and with a specific view towards the applications of algebra in persistent homology. To 
our knowledge, there is one other article with a similar focus; the AMS Notices article by Weinberger 59 1. 

We assume that the reader is comfortable with the homology functor, basic category theory and homological 
algebra including the idea of an abelian category, and basic analysis including the idea of a Lipschitz function. 

For the remainder of the paper, we will go through the various viewpoints and their strengths in order chosen 
for. To help the reader keep the descriptions in context, the article starts, here, with a very short overview 
of the upcoming contents. 



1.1. Foundations in use. There are two main genres of foundations in use, two cultures of "persistent 
homology" . 

Filtered spaces: Persistent homology is about the effect of applying the homology functor to a filtra- 
tion of topological spaces. Invariants describing the resulting homology diagrams help us construct 
tools for visualization and data analysis eventually allowing for the inference of topological structure 
for point clouds using specific constructions of filtered complexes that encode properties of point 
clouds. 

Representations of the reals: Persistent homology is about studying sublevel sets of real-valued 
functions on topological spaces. Such sublevel sets have - for nice enough functions and spaces - 
discretizations that allow us to adapt descriptions of finite diagrams of vector spaces to efficient 
descriptors. In particular, by using the "distance from a set" family of functions we can support 
inference of topological structures from point clouds. 

Both these choices come with built in benefits as well as drawbacks. They give rise to different generalizations 
of the fundamental inference problem for point clouds sampled from a topological space, and they support 
different further constructions and proofs. 

In particular, among the results that emerge from the two viewpoints, we will be discussing a selection in 
this paper: 

Stability: The representations of the reals viewpoint allows us to prove a Lefschetz-style property 
for the inference process underlying the theory: there is a metric, the bottleneck metric, on the 
invariants of the diagrams of homology groups such that the distance between the homologies of 
the sublevel sets of two different functions is bounded by the Loo-distance between the functions. 
Evolutions in the exact definitions used for persistence lead to increasingly generous assumptions in 
this bound. 

Sub- and super- and iso-level sets: By modifying the constructions used, we can get new construc- 
tions that allow us to study sequences of super- level sets, of iso-lcvel sets (or level-sets), and of the 
result from collapsing sub- or super-level sets to a single point. In particular, this brings us ex- 
tended persistence, where no infinite length intervals occur, and a number of topological features 
comes into play, including Poincare duality. Current technologies for iso-level sets tend to rely on 
zig-zag persistent homology (see below). 

Graded modules: The kinds of diagrams emerging from the filtered spaces viewpoint have the 
structure of graded modules over the polynomial ring k[t]. This recognition sparked both new 
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algorithms for computing persistent homology with far less assumptions on the chosen coefficient 
ring, and a number of extensions of the fundamental constructions that we will mention below. 

Relax the filtration requirement: In a seminal paper, Gabriel ^43^ proved that the lameness of the 
representation theory of quiver algebras depends only on the corresponding Dynkin diagram, not 
on the particular orientation of arrows in the quiver. Re-interpreting the diagrams of vector spaces 
emerging from the filtered spaces viewpoint as modules over quiver algebras rather than modules 
over k[t] allows for inclusion maps that go both forwards and backwards producing zig-zag persis- 
tent homology, which has allowed for both a topological approach to statistical bootstrapping and 
concrete approaches to iso-level set persistent homology. 

More directions: The work by Carlsson, Ishkhanov, de Silva, and Zomorodian [sj on the topology of 
configurations of pixels in natural images relied on being able to vary several independent variables 
in the construction of the intermediate simplicial complexes studied. This inspired Carlsson and 



Zomorodian 12 to study how these multi- dimensional approaches can be handled. A straight 
generalization from graded [k[i]-modules directs us to study modules over \k[ti,t2, ■ ■ ■ ,tn], which 
brings a whole range of theoretical and computational problems. Nevertheless, recent research seems 
promising. 



15 53 



Several results make specific reference to geometric complex constructions that are in common use in persis- 
tent homology. Since the choices of algebraic foundations seldom influence these constructions specifically, 
our description shall be brief and summary. Each of them requires a point cloud L - a finite subset of some 
metric space. 

Cech complex: The Cech complex is an £-parametrized simplicial complex defined as the nerve com- 
plex of the family of open e-balls around the points of L. We write Ce(L) for the resulting filtered 
and parametrized simplicial complex. 

Vietoris-Rips complex: The Vietoris-Rips complex is the most widely used construction - it is less 
dependent on dimension constraints than the a-complex, and less computationally intensive to work 
with than the Cech complex. The Vietoris-Rips complex VRe(L) at e contains a simplex {£o, . . . ,£k) 
if for all < i < j < /fc, < e. 

a-complex: The a-complex is a powerful and very nice tool - the intersection of the Cech complex 
and the Delaunay complex on a point cloud, it comes with strong theoretical guarantees. However, 
the computational complexity of the Delaunay complex means that the a-complex is mainly of use in 
2 and 3 ambient dimensions, a-complexes were introduced by Edelsbrunner, Kirkpatrick, and Seidel 



38 in 2 dimensions and by Edelsbrunner and Miicke 41 in 3 dimensions. The study of their Betti 



numbers by Robins 54 is one of the immediate precursors to the definition of persistent homology. 



Witness complexes: Witness complexes were introduced by de Silva and Carlsson 30 as one ap- 
proach to deal with the computational complexity of persistent homology. The construction uses 
a relatively small vertex set L and an often far bigger witness set W. Given a /c-simplex a with 
vertices from L and a points w £ W we say that w is an a-witness of a if the vertices of a are all 
within dk{w) + a of where dk{w) is the distance from w and its (fc + l)th nearest neighbour in 
L. We write Wa{L, W) for this simplicial complex. Witness complexes have been further studied by 
Chazal and Oudot [18 and by Chazal, de Silva, and Oudot |17 . 



2. Persistence barcodes and diagrams 

Throughout there is an underlying ideal of what persistent homology should be computing, which the field 
as a whole agrees on: given a filtered (and parametrized) sequence of topological spaces X*, the persistent 
homology H^^^{X^) is the image of the induced map Hj{Xa) Hj{Xi,). In nice enough cases the collection 
of all such homologies has a nice algebraic description as some sort of collection of intervals, and these 
intervals with their start- and end-parameters can be used to produce diagrams that allow reasoning about 
the original spaces. 

There are two main such diagrams in use - both can be seen in Figure [2] One view is the persistence 
barcode - the sequence of interval is drawn, stacked on top of each other. Such a barcode can be seen in 
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the middle of Figure [2j The rank of any particular i/j'^^(X*) is the number of intervals in the barcode that 
entirely covers the interval (a, 6). 

The other diagram in use is the persistence diagram: the start- and end-points of an interval in the 
interval decomposition of the persistent homology are taken to be x- and y-coordinates of points in the 
upper half of the first quadrant of the plane. An example can be seen to the right of Figure [2j The number 
of points contained in the quadrant delimited by the horizontal line at height a and the vertical line at width 
b determines the rank of iJ"'''(X*). 

Either of these cases is a visualization of the underlying data of a barcode^ which we can define as Cohen- 



Steiner, Edelsbrunner, and Harer 23 as a multiset in IR . The barcode is usually taken to include the 
uncountably many points along the diagonal of as part of the barcode. 

Several metrics have been proposed for the space of all such barcodes or diagrams - most of them have 
definitions more easily handled by working with the persistence diagram definition. We shall meet a few in 
this paper. For their definitions we shall assume that X and Y are two barcodes. We write Bij(X, Y) for the 
collection of all bijections between X and Y . For up to countably many non-diagonal barcode elements, such 
bijections may pair each non-diagonal element with some possibly diagonal element, and pair all diagonal 
elements with infinitesimally close diagonal elements. 



The first two definitions here are taken from j23 . The definition of Wasserstein distance is from 24 . 

Definition 1. The bottleneck distance dsiX^Y) is defined as 

dB{X,Y)^ inf sup ||a; - 7(a;)||oo • 
7eBij(x,y)a.gx 

Definition 2. The Hausdorff distance d^iX.Y) is defined on multisets X,Y in by 
dniX, Y) = max{sup inf \\x - y||oo, sup inf \\y - x\\oc} ■ 

Definition 3. The Wasserstein distance dy^{X,Y) is defined as 

i/p 



d^^{X,Y)=(^MY,\\^-j{x)\\l)j 



For diagrams from persistent homology, where the points come in different dimension, the total Wasserstein 
distance sums the infima for each dimension separately before computing the p-th root. 



3. Functions on a manifold 

The study of persistent homology originates from Edelsbrunner, Letscher, and Zomorodian ^39, , who first 
define the term and provide an algorithm for the computation of persistent homology. Taking their inspiration 
from a-shapes, the authors assume that a filtered simplicial complex is provided as input, and produce a 



description of its persistent homology. In a slightly later paper, Edelsbrunner, Harer, and Zomorodian 37 
demonstrate that persistent homology can be applied to morse complexes from piecewise linear functions on 
a manifold - the filtered simplicial complex required is given by combining the morse complex cells with the 
function values at the critical points witnessing each cell. 

From this point and onwards, one strongly present culture in persistent homology remains focused on the role 
of a function defined on a manifold as the input data for the method. This viewpoint has proven remarkably 
fruitful in the study of stability, and provides the best tools we currently have for justifying topological 
inferences with persistent homology. 

It is worth noting that a point cloud topology point of view fits in this framework: as is illustrated in 
Figure [T] the distance to a discrete set of points produces a real- valued function on the ambient space of the 
points, with a persistent homology corresponding closely to the Cech complex homology of the point cloud 
itself. 
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Figure 1. The distance to a set of points, defined for any point as the infimum of individual 
distances to points in the point cloud, produces a function for use in the functional approach 
to persistent homology. The points at the bottom of the valleys in the graph are the points 
of a 1-dimensional point cloud; and the lightly drawn cones emanating from each point 
correspond to the distance function from that point itself. The lower envelope of these 
distances forms the distance to the entire set, thus the function for encoding Cech complex 
persistent homology as a functional persistent homology. 




Figure 2. Persistence of Hq of sublevel sets of a function IR — R. In black, we see the three 
components that appear at different times show up - in the middle in a persistence barcode, 
and to the right as the three points in a persistence diagram. In red, we indicate a particular 
choice of height e, at which the sublevel set has two components - drawn below the graph to 
the left. These two components can be read off in both persistence visualizations - through 
the two intersected bars in the middle, and through the two points contained in the shaded 
red region to the right. 

3.1. A functional view of persistent homology. With this viewpoint, the fundamental given datum is 
a geometric object X and a tame function / : X — > R. In order to study the behavior of sublevel sets of /, 
persistent homology is used to measure the filtration of X given by X^ = f^^{{—oo,e]). 

A function / : X — ?> R is called tame if it is continuous, all sublevel sets have homology groups of finite rank, 
and there are finitely many critical values where the homology groups change. 

This viewpoint, and the reasons for some of the choices made in creating algorithms are at their most 
apparent when considering the 1-dimensional case, where X = R, and we consider sublevel sets of some 
function R — > R. 

Consider Figure [2] Critical points of the function correspond to points where the sublevel set topology 
changes - at minima, a new component is born, and at maxima, two components merge. To reflect these 
correspondences, we pair up critical points, choosing to pair a maximum with the latest relevant minimum, 
to reflect that the newer connected component merges in with the older one. The red line gives an example 
of a particular choice of height; the sublevel sets are split into two components, a fact reflected in the two 
bars intersected by the red line in the barcode - the number of bars at any given parameter value reflects 
the corresponding Betti number at that stage. 

We write Dgmp(/) for the collection {{b,d)} of start and endpoints of the barcode corresponding to the pth 
persistent Betti number /3p of / : X — >• R. 
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Figure 3. Going from a function on a manifold to a filtered sequence of spaces. Vertices 
of the Morse complex are given by the local minima, and each local maximum witnesses 
an edge connecting two neighbouring minima. To the left, we see the function with critical 
points marked, in the middle the sublevel sets at these points, and to the right, the corre- 
sponding filtered Morse complex. The filtered and parametrized structure of both spaces 
and complexes is clearly visible. 

We notice that the filtered simplicial complexes described in Section [Ll] are the natural representations when 
the function studied is the distance to the sampled point cloud. 

3.2. Filtered complexes. The original persistence algorithm was formulated in terms of filtered complexes, 
and the functional view is fast to generate a filtered complex from the function under study. The key method 



to do this is described in Edelsbrunner, Harer, and Zomorodian 37 : in a Morse theory approach, cells of 
a cellular complex correspond to critical values of the function, and depending on the index of the critical 
point, we can read off the dimensionality of the cell. 

The Morse theoretic viewpoint gives a translation dictionary between critical points and cells in all dimen- 
sions, even where the example given in Figure [3] is working in just one dimension. The fundamental feature 
to pay attention to is the index of a critical point - the number of negative signs in the appropriate quadratic 
form formulation of the Hessian at the critical point - the higher the index, the higher the dimension of the 
cell corresponding to that critical point and introduced at the parameter of its function value in a sublevel 
set filtration. 

3.3. The stability meta-theorem. There are results that depend crucially on a functional viewpoint in 
order to even articulate the question much less reach an answer. Most important of these is the issue of 
stability. An introductory description would be that stability produces a continuity guarantee for the process 
that goes from a function to a barcode or persistence diagram descriptor of its persistent homology. If we 
can bound the change of a function, the resulting topological description should have bounded variation. 

Stability theorems have the following general shape 

Theorem 4 (Stability meta-theorem). For a nice enough space X and nice enough functions /, 5 : X — 
a nice enough norm of the difference f g is an upper hound to the distance between the barcodes of f and 
g in some nice enough metric. 

Most of the energy going into the study of stability has been improving these concepts of nice enough, 
with significant and useful results. The development has relied at several stages on developing appropriate 
algebraic foundations to enable better theorem statements and more generous stability results. 

3.4. Vector spaces with ordered bases. The first algebraic foundation in use was to consider the result 
and the intermediate computational stages of the persistence algorithm to be a vector space with a particular 
and ordered basis chosen. This viewpoint is implicit in Edelsbrunner, Letscher, and Zomorodian [39) , where 
it generates the first algorithm for computing persistent homology. 
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3.4.1. Persistence diagrams and stability. The work by Cohen-Steiner, Edelsbrunner, and Harer |23| proves 
the first stabihty theorem for persistent homology: for a collection of persistent homology groups (referring 
to 



39 for their definition), the authors prove; 



Theorem 5. Let X be a triangulable space with continuous tame functions f,g:'A-^ 
diagrams satisfy dB(Dgmp(/), Dgmp(5)) < ||/ - .g||oo- 



Then the persistence 



Here, tame is defined to mean that the function / has a finite number of homological critical values and that 
the homology groups of sublevel sets are all finite-dimensional. This theorem was first proven, restricted to 



p = 0, by d'Amico, Frosini, and Landi 28 , using the language of size theory. 



The results from Cohen-Steiner, Edelsbrunner, and Harer |23j have been generalized in several steps since 
its publication. Many if not most of these generalizations include a variation in the algebraic foundations to 
enable their greater power of proof. 

3.5. Diagrams over (R, <). The first paper generalizing the results in was published by Chazal, Cohen- 
Steiner, Glisse, Guibas, and Oudot [l6]. The paper defines a persistence module J" to be a diagram in the 
category Vectj^ of the shape of the total order (R, <). In other words, assigns a vector space T{x) to each 
a; € R, and a hnear map T{x < y) to each order relation x < y, making J" a functor (R, <) — >• Vectik. We 
shall refer to these persistence modules as (R, <)-modules and to the maps T{x < y) as translation maps.. 
The authors define a new tameness notion, and are able to prove an extended stability theorem. 

Definition 6. A (R, <)-module T is 5-tame if for any a < a + 5 < P the rank of F{a < /3) is finite. 

Definition 7. A function / : X — > R is said to be 5-tame if the {R, <)-module of the homologies of the 
sublevel set filtration of X generated by f is 5-tame. 

The authors also define weak and strong interleaving - concepts that will re-surface repeatedly in this 
direction of study. The articulation of the original definitions will be easier if we write X / (a) for the set 

/-H(-oo),a]). 

Definition 8. Two functions /, 5 : X — R are weakly e -interleaved for e > if there is some a € R such 
that 

X/(a + 2ne) C Xg{a + {2n + l)e) C X/(a + 2(n + l)e) 

for all n lE Z. 

Definition 9. Two functions /, g : X R are strongly e -interleaved for e > if for all a £ R and all 

n ez, 

Xf{a + 2ne) C Xg(a + (2n + l)^) C X/(a + 2(n + l)e) . 

The definitions extend directly to generic (R, <)-modules by the following definition: 
Definition 10. Let J- and Q be two (R, <) -modules. Consider the diagram 



F{a + 2ne) 



J"(a + (2n + l)e) 



g{a + 2ne) 



g{a+{2n + l)e) 



T{a + 2{n + l)e) 



g{a + 2{n 




The modules T and Q are weakly e-interleaved if there are linear maps for the diagonal arrows that make 
all these diagrams commute for some fixed a G R and all n £ Z. 

The modules J- and Q are strongly e-interleaved if there are linear maps for the diagonal arrows that make 
all these diagrams commute for all a £ R and all n £ Z. 

Finally, for a persistence diagram D, we can define the (5-persistence diagram Dg by removing any points 
within 5 of the diagonal, i.e. any point {b, d) with Q < d — b < 5. Instead of augmenting the diagram with 
the diagonal A = {{x, x) : x & R}, we augment Ds with the diagonal {(a;, x -\- 6) : x £ R}). 
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With these definitions in place, we are able to state the most important results from 16 
Proposition 11. Let /, g : X — > IR be two real-valued functions on a topological space. 

(1) If f,g are strongly e -interleaved, then they are weakly e -interleaved. 

(2) // /, g are weakly e -interleaved, then they are strongly Ze-interleaved. 

(3) /, g are strongly e-interleaved if and only if \\f — g\\oo < £• 

The last of these three parts is crucial for this approach to stability - it means that stability results can be 
translated into how persistence diagrams of interleaved modules behave rather than how persistence diagrams 
depend on functional properties. 

Write II for the (R, <)-modulc that has ll{x) = 0< if X E [a,b] and Ia{x) = otherwise, and such that all 
^{x 5: y) are the zero map unless a < x < y < b, in which case I^(a; < y) is the identity map. We call 
the interval module for the interval [a,b]. Similar definitions can be produced for (a, 6], [a,b) and (a, 6). 
Some (R, <)-niodules decompose into a direct sum of interval modules. 

For an (R, <)-module J- that does decompose into an interval module, we write Dgm(J-') for the multiset of 
endpoints of intervals in a decomposition in such modules. Since the interval modules are indecomposables in 
the category of (R, <)-modules, it follows from the KruU-Schmidt-Azumaya-theorem that the decomposition, 
and therefore this diagram, is unique if it exists. 

Theorem 12. Suppose J- and Q are weakly e-interleaved (R, <)-modules that both are S-tame for some S > 0. 
Then 

dB{Bgm{J^)s,Bgm{g)s) < 3e 

Theorem 13. Suppose J- and Q are strongly e-interleaved {R, <) -modules that both are S-tame for some 
6>0. Then 

dH{BgmiJ^)s,Dgm{g)s) < 3e 

In particular it follows that 

Theorem 14. // /, g are two S-tame functions such that \\g — /||oo l£ ^ for S > and £ > 0, then for any p, 

dB(Dgnipif)s,'Dgmp{g)s) < e 



It is worth noticing that for Theorem 14 the assumptions on triangulability for X and on continuity for /, g 
from Theorem [5] have been removed. 

Theorem 15. Suppose L is a finite point cloud in some metric space. There are {R,<)-modules 

HpC{L){a) = Hp{C2aiL)) 
HpVR{L)ia) ^ HpiVR24L)) 
HpWiL,W){a) = Hp{W2AL,W)) 
with all the translation maps induced from the inclusion maps. 
Then 

ds(Dgm(i/p C{L)),Bgm{Hp VR(i))) < 1 . 

If the points of L are densely sampled from a compact set L C C X C R'', with sampling conditions stated 
in IT8[ Theorem 3.7], 

dB(Dgm(ffp CiL)), Dgm{HpWiL, W))) < 3 . 

3.6. Directly to diagrams. Some of the stability results skip the intermediate step of persistence modules 
altogether, and argue entirely in terms of the persistence diagram and its behaviour. As far as we have been 
able to tell, this operates with an underlying assumption of using (R, <)-modules as an algebraic framework, 
but some papers never articulate this choice concretely. 



Cohen-Steiner, Edelsbrunner, Harer, and Mileyko 24 prove 
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Theorem 16. Let be a triangulable compact metric space implying bounded degree k total persistence, for 
k <1, and Zef /, 5 : X — > IR be two tame Lipschitz functions. Then 

for all p > k, where C — Cx niax{Lip(/)*'', Lip(g)'''}. 

Here, a metric space X implies bounded degree k total persistence if there is some Cx depending only on X 
such that PerSfc(/) < Cx where Df is the persistence diagram of a sublevel set filtration of / : X — >■ R with 
Lipschitz constant Lip(/) < 1 and Pers/c(/) — J2{b d)eDf b-d>t(^ ~ '^)'^- '^^^^ ^ Lipschitz function tame 
if the homologies of the sublevel sets come with finitely many changes and each homology group has finite 
rank. 

With the same notation, there is a stability theorem for the total persistence moments PerSp(/) too. We 
write Amp(/) = max^-gx f{x) - min^gx f{y)- 

Theorem 17. Let be a triangulable, compact metric space that implies bounded degree-k total persistence 
for k > 0, and /ef /, 5 : X — )■ IR be two tame Lipschitz functions. Then 

I Persp(/) - Pcrsp(g)| < Apw^-^-'^C ■ \\f - g\\o. 

for every p > k+1, where C = Cx max{Lip(/)'^, Lip((7)'^} andw is bounded from above 61/ max{Amp(/), Amp(g)}. 

3.7. Measures on the real line. The approach in Section [33] was elaborated by Chazal, Silva, Glisse, and 



Oudot 19 . In that paper, the authors deal primarily with the fundamental question of which conditions on 
a total order module allow for a persistence diagrams decomposition to even exist. For cases where these 
decompositions do exist, they are able to prove stability theorems; and in order to establish this existence, 
they develop a fruitful notation and viewpoint. 

Their approach continues with the emphasis on the behaviour of persistence diagrams that we saw in Sec- 
tion |3.6| There are theorems in their work that relates the work to the behaviour of specific parametrized 
filtrations on concrete spaces, but most of the work considers persistence diagrams of abstracted and decom- 
posable (R, <)-modules with specific tameness conditions directly. 

3.7.1. Persistence measures. At the core of this approach is the recognition that multisets of points in the 
plane correspond precisely to locally finite integer-valued measures on the plane, that can then be considered 
to be counting the points as point masses. To elaborate, the authors consider four types of persistence 
intervals: [a, 6], [a, &), (a, b], (a, b). To acquire a coherent notation for these, they introduce point decorations 
- can be thought of a -|- e for some infinitesimal e, so that an interval starting at a+ is open at that end, 
and an interval ending in a+ is closed at that end. Similarily, b~ can be thought of as b — e for an infinitesimal 
e, so that an interval starting in b~ is closed, and an interval ending in b" is open. Following 19 , we write 
a* when we do not have any information about the decoration of a. 

Viewing an interval (a* ,b*) as a point in a persistence diagram - viewed as a multiset in the plane - 
the point is some (a, 6) decorated with a flag pointing in one of the quadrant directions: ++,H — , — h, or 

. Translation from a persistence diagram to a persistence measure now follows easily: for a rectangle 

(a, b) X (c, d) in the plane (with undecorated endpoints), a point is counted by the measure if the point flag 
points into the interior of the rectangle. With this definition, additivity for the measure function can be 
proven, and a whole slew of measure theoretic machinery can be used. 



Far from all possible (R, <)-modules are decomposable into interval modules - de Silva 29 gives as examples 
of decomposable total order modules the classes 

(1) Modules over finite orders (proven by Gabriel ^43, ) . 

(2) Modules over (Z, <) of locally finite dimension (proven by Webb [58]). 

(3) Modules over (R, <) of locally finite dimension (proven by Crawley-Boevey [27]). 

but also points out that Webb 58 demonstrates a module M over (R, <) where each M{x < y) has finite 
rank, but still the module is not decomposable into intervals. 
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Figure 4. Tamcness-conditions schematically illustrated. 

The measure approach, however, provides a decomposition into interval modules whenever one actually 
exists, and if no global decomposition exists, can identify regions of the plane where the persistence diagram 
is decomposable, and provide a decomposition over these regions. 

For an arbitrary representation of (R, <), the authors are able to prove a measure /i that coincides with 
the point mass perspective: for a persistence module M such that every M{x < y) has finite rank r^, the 
measure //([a, h] x [c, d]) is defined to be — '"a ~ '"b + '"a- This way, the measure /i is defined for arbitrary 
modules with finite rank translation maps. 

3.7.2. Tameness conditions. Based on these decomposability regions, the authors are able to define a family 
of tameness conditions, with inclusions of classes of modules along the arrows: 



v-tame 



finite > locally finite > q-tame 




r-tame 



h-tame 



Here, a module M is... 

finite: if M is a finite direct sum of interval modules. 

locally finite: if M is a direct sum of interval modules, such that only finitely many span any given 
t e R. 

q-tame: if the measure corresponding to AI is finite over every quadrant not touching the diagonal, 
h-tame: if the measure corresponding to M is finite over every horizontally infinite strip H not touching 
the diagonal. 

v-tame: if the measure corresponding to M is finite over every vertically infinite strip V not touching 
the diagonal. 

r-tame: if the measure corresponding to M is finite over every finite rectangle not touching the diag- 
onal. 

These four last cases are sketched out in Figure [4] the horizontal bars above and to the left correspond to 
interval modules that survive until +oo and interval modules that were born at — oo respectively. 

3.7.3. Order module view of interleaving. The plane has a partial order given by (pi, qi) < (p2, (72) if and 
only if pi < qi and p2 < <Z2- We can define the shifted diagonals Aj, — {{p^q)\q — p — 2x} as subsets of 
the plane; with order structure inherited from this order on the plane. These diagonals are isomorphic - as 
posets - to (R, <): by picking i 1— >■ (t — x,t + x), this isomorphism is canonical. 
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With this structure, the authors define (strong) \y — x| -interleaving of persistence modules M, N as the 
existence of a persistence module / over A^^ U Aj^ such that /|a^ — M and /|a — N. The authors prove 
that if M, N are ^-interleaved, then there is a family of persistence modules Px such that Pq = M, Pg = N, 
and Px,Py are \y — a; [-interleaved for all x,y G [0,5]. These can be fused together into a single persistence 
module over the diagonal strip {{p,q) : < g — p < 26} with the above partial order structure. 

From this interleaving relation, the authors define an interleaving distance: 

di{M,N) = mi{5 : M,N are ^-interleaved} 

This distance is a pseudo-metric: the authors prove the triangle inequality, but give as an example the four 
interval modules for the intervals {p~ ,q~), (p"*", q~), {p~ , q'^), (p"*", 9''"), that all have interleaving distance, 
but are not in fact isomorphic. 



3.7.4. Stability. Based on this machinery, the authors are able to prove a number of stability-related theo- 



rems, that all lead to the fundamental isometry theorem, occuring in 19 Theorem 4.11], and also proven 
independently by Lesnick [49) : 

Theorem 18 (Isometry). Let M,N be q-tame persistence modules. Then di{AI, N) — (ib(Dgm(A/), Dgm(A^)). 

The applications of the isometry come from identifying tameness conditions for classes of persistent homology 
modules: 

Theorem 19 (Theorem 2.23 of fl9^). Let X be a locally compact polyhedron, and / : X — > R a proper 
continuous function. Then the persistent homology of the sublevel set filtration of {X, f) is h-tame, v-tame, 
and r-tame, but not q-tame. 

Notice that the collection of tameness conditions that hold here mean that as long as we ignore any parts of 
the persistent homology of the sublevel set filtration that persists all the way from — oo to oo, the remainder 
is tame enough for the isometry theorem, and therefore stable. 



Theorem 20 (Proposition 5.1 of 17 ). // (X, dx) is a precompact metric space, then the Cech and Vietoris- 



Rips persistent homology modules are q-tame. 

It is also well-known in the community that strong finiteness conditions, and therefore also q-tameness, hold 
for the homologies of sublevel filtrations if 

• X \i a. compact manifold and / is a Morse function. 

• X is a compact polyhedron and / is piecewise linear. 

From the isometry theorem also follows, by the view of interleaving as a persistence module, the classical sta- 



bility theorem of Cohcn-Steiner, Edelsbrunner, and Harer [23] as we have already discussed in Section 3.4.1 
Theorem [Sj 

3.8. Categorification. Work by Bubenik and Scott [2] studies the category of functors (R, <) — > Vectik, 
and is able to prove that the category of persistence modules is abelian. 

They leverage this to prove a generous stability theorem: for arbitrary (not necessarily continuous) functions 
X — > IR from a topological space, and any functor H from topological spaces to a category of real-indexed 
diagrams in an abelian category 2?, the interleaving distance between the diagrams generated by applying H 
to the sublevel set filtrations of the functions is bounded above by the Loo-distance of the functions. 



Furthermore, they prove many of the categories that emerge naturally in persistent homology are abelian. 
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4. Filtered topological spaces 



The other cuhure present in the study of persistent homology focuses on the role of a filtered topological space 
and derived algebraic objects as the fundamental notion. This viewpoint has sparked a wealth of algebraic 
abstractions and given rise to several different notions of the shape of a persistent homology theory. 

Connecting this viewpoint with the original study by Edclsbrunner, Letscher, and Zomorodian [39| , and 
indeed with the entire viewpoint present in Section [3j one may point out that for any function / : X — >■ R, 
the sublevel sets /~^((— oo, x]) form a filtration of X. For tame enough - finitely many topological critical 
points, finite rank homology for any sublevel set, and similar conditions - functions, this filtration can be 
described by a finite filtration, or even a parametrization with finitely many different states. 

Since homology is a functor, and inclusions are continuous maps, applying homology to a filtration produces 
a diagram of homology groups on the shape 

iJjXo ^ iJjXi ^ > HjX„ 

and by interpreting this diagram as a module in one of a number of different possible module categories, 
further generalizations are possible. 

Commonly, the geometric filtrations in use in persistent homology really are parametrizations - for any value 
e G K, there is some resulting space X^ - that happen to generate filtrations: 

^(-oo,e] = U "^"^ X(_oo,£] C X(_oo,£'] 

For these cases, it is common to blur the lines between the definitions of filtered spaces and parametrized 
spaces. 



4.1. Vector space with ordered basis. If a simplicial complex is filtered, then this induces a preorder 
on simplices of the simplicial complex - any simplex precedes all simplices from later filtration stages. Any 
preorder can be specialized to a total order by picking arbitrarily some ordering of elements that do not 
already have an ordering setup by the preorder - and this is certainly the case with the preorder from a 
filtration. This total order can even be picked to be compatible with the coface relation on simplices. 

In particular this means that from a filtered simplicial complex, we can easily construct a chain complex with 



a totally ordered simplex basis. This was the basis of the original algorithm in 39 : simplices are consumed 
from a totally ordered stream, and the change in topology resulting from the inclusion of any one simplex is 
reflected in a changing state, from which barcodes can be read off. 



This setting has also informed extensions to the work by Edelsbrunner, Letscher, and Zomorodian 39 : in 
a paper by Cohen-Steiner, Edelsbrunner, and Harer |22| , a total ordering of the simplices in a simplicial 
complex K is used to filter K both by taking initial sequences Kj — (cfq, . . . , CTj) and by taking terminal 
sequences Lj — (dj, . . . , ctat) of the simplices. With these building blocks, then, the original persistent 
homology sequence 

H^Ko)^ >H4Kn) 

can be extended by taking homology relative to terminal sequences to produce an extended persistence 

sequence 

H.4K0) ^ • • • H^iKN) ^H4Kn,Lo)^---^ H,{Kn,Lm) 

This sequence, motivated by Poincare and Lefschetz duality, carries a number of benefits over the original 
persistent homology theory. One of them is that no infinite barcodes occur - any interval will have an 
endpoint, possibly among the relative homology groups. 

Duality produces numerous symmetry relations in the persistence diagram for extended persistence. The 
paper [22| describes a way to draw the persistence diagram for the extended case so that the symmetries 
emerge as mirror symmetries in the diagram - with some adjustments for dimension shifts in the duality 
results. 
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For the case where the ordering of cells comes from piecewise linear functions on a simplicial complex, 
extended persistence has a stability theorem bounding the bottleneck distance of diagrams by the Loo -norm 
of the difference between the corresponding functions. See Section |3.3| for more details on stability. 



Quite some research has gone into optimizing the persistence algorithm in various ways. Here, handling the 
sorted boundary matrix tends to be at the center of attention. Some notable results include: 

Cohen-Steiner, Edelsbrunner, and Morozov [26] demonstrate how the change in the persistence diagram 
induced by the re-ordering of simplices in the filtration can be traced in linear time: the result is a vineyard, 
tracing the change in the persistence diagram induced by a homotopy between functions inducing filtrations 
of a simplicial complex. This approach allows the proof of a combinatorial stability theorem, bounding 
bottleneck distance by the Loo-distance between simplicial approximations of continuous functions on a 
simplicial complex. 



Cohen-Steiner, Edelsbrunner, Harer, and Morozov 25 study a functional persistence situation where / 



X — > R, 5 : Y — > R, and Y C X. In this case, there is an induced map from the homology of the sublevel sets 
of g to the homology of the sublevel sets of /, and the authors give algorithms for computing kernels, images, 
and cokernels of this induced map. Their algorithms fundamentally work with adjusting sorted matrices of 
simplices. 



Chen and Kerber 20 work with Monte Carlo algorithms for estimating the rank of a matrix to compute 



persistence barcodes from a sorted boundary matrix. This approach speeds up the computation of barcodes. 



Chen and Kerber 21 notice that since the persistence algorithm induces a pairing between columns in 
the boundary matrix, pairing up a completely emptied out column with one that corresponds to the last 
simplex to bound the cycle, a re-ordering of the computation can eliminate many matrix operations. Their 
approach wcannot be modified to output concrete representative cycles, but improves asymptotic bounds for 
the problem of computing a barcode. 



4.2. Graded modules over k[t]. The first significant advance in the choice of underlying algebraic structure 
for persistence modules came from Zomorodian and Carlsson [61 . They observe that a diagram of vector 
spaces 

Vq-^Vi^ ... 

can be modelled as a graded module over the polynomial ring k[t]. The module V^, is taken to have Vd in 
degree d, and the action of multiplying by t corresponds to the linear map Vd — > Vd+i. Seeing as homology 
groups with field coefficients are vector spaces, and the induced maps from the inclusions in the filtration 
are linear, this construction translates a persistent homology diagram to a graded module. 

At this stage, Zomorodian and Carlsson '6T| observe that the existence of a barcode decomposition follows 
directly from the fact that k[t] is a principal ideal domain, and therefore any module V^, decomposes into a 
direct sum of cyclic modules. These come in two versions: torsion modules isomorphic to [k[i]/(t'*) for some 
natural number d, and free modules isomorphic to k[t]. These two classes can be directly translated into free 
and finite intervals [a, a + d) or [a, oo). 



The work in 
Zomorodian 



61 



39 



also demonstrates that the persistence algorithm described by Edelsbrunner, Letscher, and 
works with the same result for arbitrary field coefficients where the original description 



required coefficients in the field Z/2Z. 



This work has been extensively cited - to the point where the papers 39 61 are the standard reference 
citations for the persistence algorithm, and a number of extensions to the results have been provided, as well 
as numerous applications to the extension of expressive power the change of fields produces. 



4.2.1. Results relying on non-binary fields. The most obvious direct usefulness of the graded polynomial ring 
module approach has been in cases where the dependency of homology on the characteristic of the coefficients 
matters. This was the case in work by Carlsson, Ishkhanov, De Silva, and Zomorodian |j9j. 
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A study by Lee, Pedersen, and Mumford 48 investigates the statistics of 3 x 3 pixel patches from naturally 
occuring images. They find, inter alia, a high density circle in the first few PCA coordinates. This circle, 
they notice, corresponds closely to linear gradient directions within the dataset. 

Carlsson, Ishkhanov, De Silva, and Zomorodian [o] pick up the same dataset, and study it using persistent 
homology. They are able to recover two additional, secondary, high-density circle shapes within the dataset. 
These three circles combine to form a high-density 2-dimensional surface, which after computing persistent 
homology over both Z/2Z and over Z/3Z could be identified as the Klein bottle. 

The ability to compute persistent homology with coefficients in Z/3Z was crucial for this approach, and 
algorithmically dependent on the graded module over k[t] approach to persistent homology. 



4.2.2. Multi- dimensional persistence. With inspiration from the several relevant parameters affecting the 
analysis in j9j, Carlsson and Zomorodian fT2^ constructed multidimensional persistence. The underlying 
observation is that just as graded modules over k[t\ model singly parametrized topological spaces, adding 
more parameters corresponds to adding more variables to the polynomial ring. Hence, a d-dimensional 
parametrization can be modeled in a persistence way by working with graded modules over k[ti, . . . ,td\. 

The multi-dimensional theory has problems - chief among which is the lack of as useful a decomposition into 
a small and easy to describe class of indecomposables. The category of graded modules over k[ti, . . . ,td] has 
no complete discrete invariant, but Carlsson and Zomorodian [12^ propose a discrete invariant ~ the rank 
invariant - turning out to be incomplete but useful. 

The theory has been further studied since: 



Carlsson, Singh, and Zomorodian 11 introduce Grobner basis methods for computing multidimensional 
persistent homology, demonstrating that for one- critical multifiltrations, the rank invariant can be computed 
in polynomial time. The translation process they use to recast the problem to a Grobner basis computation 



has potential exponential blowup behaviours for the general case. Patriarca, Scolamiero, and Vaccarino 53 
demonstrate that by avoiding the mapping telescope and using more refined Grobner basis approaches, the 
computation can be bounded to polynomial time in general. 

The multidimensional approach has received a lot of attention from the Italian size function community, [l] 
[5] [13 14 treat multidimensional persistent homology in a size function framework as important tools for 



image analysis. 

Questions of stability for persistence modules have been studied, both in the size function community ([13[ 



[14]) and in the context of persistent homology by Lesnick 49 . 



4.2.3. Cohomology and duality. Persistent cohomology was mentioned by Cohen-Steiner, Edelsbrunner, and 
Harer ^22 , who immediately use Lefschetz duality to transform it into relative homology, de Silva and 
Vejdemo- Johansson [32^ , later extended by Morozov, de Silva, and Vej demo- Johansson [51 , produce an 
algorithm for computing persistent cohomology and observe connections to computing intrinsic circle- valued 
coordinate functions from point cloud datasets. 

This work inspired a paper by de Silva, Morozov, and Vejdemo- Johansson 131' in which two duality functors - 
M» hom[|<(M*, k) and h.omm^{M^, k[t]) on graded k[t]-modules are studied, and how these functors 

affect both the persistence algorithm itself, the ordering of basis elements in a sorted vector space approach, 
and how the barcodes are modified. These two functors allow the transport of information between relative 
and absolute versions of persistent homology and cohomology. 



4.2.4. Algebraic adaptation of topological constructions. In ongoing work, Lipsky, Morozov, Skraba, and 
Vejdemo- Johansson [SOj work out algorithms and approaches for using spectral sequences of graded k[t]- 
modules to parallelize the computation of persistent homology. The approach fundamentally relies on the 
algebra of graded k[i]-modules as a proxy for persistent homology. 
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From this work, Skraba and Vejdemo- Johansson 55 work out more detailed algorithmics for graded 



modules and are able to generalize the results from Cohen-Steiner, Edelsbrunner, Harer, and Morozov [25] 
to allow computation of images, kernels, and cokernels of a wider range of maps in persistent homology. 

4.3. Modules over a quiver algebra. Another algebraic model that describes the persistent homology 
diagrams of vector spaces is given by quiver algebras. A persistent homology diagram of the shape 

can be considered as a module over the path algebra kQ for Q the quiver 



A theorem by Gabriel [43] states: 

Theorem 21 (Gabriel's theorem). Ein Kocher K hat genau dann nur endlich viele Isomorphieklassen von 
unzerlegbaren endlichdimensionalen k-linearen Darstellungen, wenn K eine "disjunkte Vereinigung" endlich 
vieler Kocher der Klassen A^., oder ist, e>\,m>A, Q<n<^. 

A quiver K has finitely many isomorphism classes of irreducible finite dimensional \k-linear representations 
if and only if K is a disjoint union of finitely many quivers of the classes A^, Dm, or En for e > 1, m > 4, 
6 < n < 8. (translation: Mikael Vejdemo- Johansson) 

In particular, Gabriel goes on to prove that the exact isomorphism classes that show up for the quivers of type 
Ag - quivers of linear sequences of arrows, possibly alternating in direction - are the interval modules. These 
have some connected interval along the linear sequence where one-dimensional vector spaces are connected 
by identity maps - and outside this interval, all maps are and all vector spaces are zeros. 

For the case of "classical" persistent homology, this recovers the barcode description for the case of a filtered 
finite simplicial complex: the persistent homology decomposes into a direct sum of irreducibles, and these 
irreducibles all are these interval modules. To describe each interval module, it is enough to state its start 
and end index, which is the exact data that a barcode conveys. 

This approach has given rise to two generalization directions in particular. 



4.3.1. Zigzag persistence. Carlsson and Silva 10 pointed out that Gabriel's theorem has concrete conse- 
quences for topological data analysis. In particular, the non-dependency on arrow direction for a quiver 
to qualify as having type A,, means that we can consider quivers where arrows alternate direction, either 
occasionally or consistently. 

This paper introduces the fundamental idea, provides matrix algorithms for computing zigzag persistence, 
and provides the diamond principle^ relating how local changes along the zigzag reflect in changes to the 
persistence diagram. The paper also suggests several applications where the zigzag naturally arises: 

Balancing different parameters: In the study by Carlsson, Ishkhanov, De Silva, and Zomorodian 
[9] , the p% densest points as computed with a parametrized density estimator were used to determine 
the topology of the dataset. For studies like this one, it is worth while to try to work with all possible 
values of the parameter determining the density estimator at once - to replicate the success persistent 
homology has in sweeping over entire ranges for a parameter. 

Writing for the densest p% of the point cloud X as measured using the parameter r. Varying 
r along ri < r2 < • • ■ < rjq , there is a zigzag 

xp^yjxp^ ^?,uxp ^?3uxp ... ^.ViUX.^„ 

X?, XP XP^_^ XP^ 



For each point cloud in this sequence, compute a geometric complex, and compute its homology - 
the resulting diagram is a zigzag diagram, and its decomposition into barcodes carries information 
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about the variation of r in a way directly analogous to how persistent homology itself measures 
homological features over varying values for a parametrization. 
Topological bootstrapping: Similar to bootstrapping in statistics, one may want to take a sequence 
of small samples X, from a large dataset X and estimate the topology of each X^ individually. 
Doing this, disambiguation between local features of each X^ and global features of X is not entirely 
transparent. 

Here, the union zigzag provides a method for persisting features across several samples: 
Xi U X2 Xi U ^3 X^ U ^4 . . . X]ss—\ U X]si 

X\ X2 X^ X4 Xn^i Xn 



Features that are local to any one of the point clouds will not persist along the zigzag, while global 
features will be carried along the zigzag to long barcodes. 

This approach was further studied for practical aspects by Tausz and Carlsson [56j, who give 
concrete algorithms for the computation of the union zigzag, and demonstrate the computational 
behaviour on a number of concrete examples, including the images dataset studied in fo'. 

The union zigzag was also further applied to dynamic network analysis by Gamble, Chintakunta, 



and Krim 44 



Levelset zigzag: Given a space X and a continuous function / : X — R, the levelset zigzag would 
relate the levelsets of / through a zigzag, introduced by Carlsson, de Silva, and Morozov |7j: 

r'([si,S2]) r\[s2,S,]) ... /-1([S„-1,S„]) 

^ ^ ^ ^ y ^ ^ ^ 

rHsi) rHs2) r\s3) f-\s^.,) f-\s^) 



where aj are the critical values of /, and Sj are picked to satisfy: 

—00 < sq < ai < si < a2 < ■ ■ ■ < s„_i < a„ < s„ < 00 
This zigzag produces a computational approach to the interval persistence introduced by Dey and 



Wenger 33 



Carlsson, de Silva, and Morozov [t] also elaborate the diamond principle to connect it with the Mayer- 
Vietoris long exact sequence, and give a concrete graphical language for modifying barcodes between union 
and intersection zigzag sequences. This Mayer- Vietoris relation produces a large diagram from the levelset 
zigzag (see description below) introduced in the paper that connects to extended persistence, and admits a 
stability theorem. 



4.3.2. Circular persistence. In a sequence of preprints, Burghelea and Dey [3], Burghelea, Dey, and Dong 
[4] study what they call persistence for circle valued maps. This treats the question of how to adapt the 
methods of persistent homology in order to deal with studying maps / : X — )• S*^ instead of / : X R. Such 
maps appear naturally when studying cohomology, a fact also underlying the work by Morozov, de Silva, 
and Vejdemo-Johansson [5l] that we described in Section 4.2.3 



The authors show that by discretizing the map / : X — S"^ on its critical points, as is done in the real-valued 
case too, the resulting diagram of homology groups takes the shape of a cyclic quiver: write G2m for a 
directed graph with 2m vertices whose underlying undirected graph is the cycle C'2m- Then G2m forms a 
quiver, whose path algebra has representations of the right shape to describe circular persistence. 



Drawing on results by Donovan and Freislich 34 and by Nazarova [52], demonstrating that these quivers 
have indecomposables classified by barcode spirals coupled with Jordan cells, Burghelea and Dey ,3 produces 
algorithms and methods to both compute these indecomposable descriptions, and to solve numerous Betti 
number computation problems with the spiral and Jordan cell description of a circular persistent homology 
module. 
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4.4. Diagrams of vector spaces over order categories. The approach that shaped Section [3| studying 
persistent homology by studying categories of modules over (R, <) is an approach that leads to fruitful 
approaches to the filtration-based view as well. 



Bubenik and Scott have results on categorification of persistent homology, see Section 3.8 Their ap- 
proaches, while focused on finite type diagrams over (K, <) seem to be applicable to more general categories 
of diagrams of vector spaces. With an adapted notion of interleaving distance, the applicability to (N,<)- 
diagrams and thereby to arbitrary filtrations should be immediate. 

A recent preprint by Vejdemo-Johansson [57^ works out a slightly weakened form of categorical equivalence 
that relates tame and lower bounded diagrams over (R, <) to tame diagrams over (N, <), thus providing an 
approach to comparing on a categorical level the different approaches produced by considering filtrations or 
by considering functions on a manifold. 

Another approach that fundamentally relies on order categories can be found from Chacholski, Scolamiero, 



and Vaccarino 15 . The authors approach multi-dimensional persistence (see Section 4.2.2) by modeling the 
persistence modules as diagrams over (N'',<), where < is the partial order on N'' induced by coefficient- 
wise comparison. They give a concrete and more importantly local algorithm for computing the family of 



invariants described by Carlsson and Zomorodian 12 , thus approaching a more practical and algorithmic 
approach to persistent homology. 

One more result that emerges from a diagrams of vector spaces approach, and that is important to mention 
in this paper is from Ellis and King |42) . The authors consider five important filtrations of finite groups by 
normal subgroups - the lower central series, the lower p-central series, the derived series, the upper central 
series, and the upper p-central series. For each of these cases, the group cohomology modules of each group 
in the filtration combine into a persistent group cohomology diagram which works as a group invariant with 
noteworthy discriminatory strength between groups. 



5. Shapes of theories, future directions 

A dichotomy such as the one we have seen above cries out for a unifying theory - everyone start out with 
the same underlying problem, and believe they do approximately the same thing, there should be a way to 
treat all the algebraic foundations in use as aspects of the same underlying theory. While such a unification 
is not published as this paper is finalized, there are several ongoing efforts in the community that may well 
lead towards a unifying theory of persistent homology. 

The following descriptions are speculative in nature, describing ongoing work and possible trends, and is 
fundamentally based on personal communications with Gunnar Carlsson, Justin Curry, Robert Christ, David 
Lipsky, Amit Patel, and Primoz Skraba. 

With the plethora of differently shaped theories that we have described in Sections |4.2.2[ |4.3.1[ |4.3.2[ and 
|4.4[ a good unification that helps the field forwards will have to deal with the fact that persistent homology 
is not done with a uni-directional linear progression of some parameter. Instead, a unification will have to 
systemize handling of differing shapes of the theory. 

Once we can accomodate quiver-based shapes, as in zigzag persistence, alongside both continuous and discrete 
shapes, as with the difference between (R, <)-modules and IkQ-modules for an ^g-quiver Q, the step is not 
far to start considering tree-like, graph-like, or arbitrary topological spaces describing the underlying shape 
of a persistence theory. 

The group at the University of Pennsylvania, led by Robert Christ, has already been building up interest 
in using sheaves for engineering applications of topology for a while (see [46| - preprints by Justin Curry 
and by Sanjeevi Krishnan on their cosheaf and sheaf work are still pending). While the details are yet to 
be settled, a sheaf-based approach looks promising both for unifying persistent homology and for providing 
new techniques for applying algebraic topology. 

Amit Patel and Robert MacPherson, with input from Paul Bendich, Frederic Chazal, Herbert Edelsbrunner, 
Dmitriy Morozov, and Primoz Skraba, are working on using sheaves of well groups as a description of 
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persistent homology. Well groups quantify acceptable noise in the sublevel set approach to / : X — > R, but 
have yet to find a large range of computable situations. 

A sheaf-based approach to persistent homology using a particular topological space to describe the shape of 
the underlying theory is also part of the immediate research agenda for Mikael Vej demo- Johansson ~ classical 
persistent homology as tame (R, <)-diagrams of vector spaces would be the internal algebraic topology of 
a particular topos of sheaves, where the underlying topological space describes this particular shape of the 
theory. 



6. Conclusion 



The field of persistent homology draws from a wide range of particular choices of algebraic foundations to 
describe very similar processes under a common heading. The choices concretely enable a wide range of 
valuable results, from improved algorithms and new directions of generalization to stability and a road- map 
towards enabling statistical inference using persistence. 

The choices divide, roughly, into two classes with noticable differences - and from both directions there are 
things provable in one formalism that are all but inconceivable in the other formalism: stability results seem 
to be a very odd family of theorems to prove with a strict adherence to a filtration-based point of view, while 



the results by Ellis and King 42 are inconceivable if persistent homology can only be thought of as working 
with functions on a manifold. 

Thus, both classes are important view points that enrich the field. Hopefully, the future will bring a 
satisfactory unification of the foundational choices, demonstrating that there is a single underlying principle 
to the field. 
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